Creating speaker-specific phonetic templates with a speaker-independent phonetic recognizer: implications for voice dialing
نویسندگان
چکیده
We present a new approach to speaker dependent template generation which uses dramatically less storage to represent a speaker's words, with minimal degradation in recognition accuracy. In this approach, the symbolic string produced by a speaker-independent phonetic recognizer is used to represent utterances. We investigate eeective procedures for template generation, and compare the results of these procedures to templates represented by acoustic parameters for utterances produced with diierent telephone handsets. The use of speaker-speciic templates led to a reduction of about 1:500 in data-storage requirements with comparable recognition accuracy. In also compare recognition performance for speaker-speciic and speaker-independent templates , and for combinations of the two. The results showed that combining speaker-speciic and speaker-independent templates produces better recognition performance than either alone. A voice dialing system is described which incorporates the speaker-speciic templates.
منابع مشابه
Phonetic, idiolectal and acoustic speaker recognition
This paper describes a text-independent speaker recognition system that achieves an equal error rate of less than 1% by combining phonetic, idiolect, and acoustic features. The phonetic system is a novel language-independent speakerrecognition system based on differences among speakers in dynamic realization of phonetic features (i.e., pronunciation), rather than spectral differences in voice q...
متن کاملTechniques for robust speech recognition in the car environment
The use of voice commands or navigation features in the car is becoming a necessity. As keyboard and display interfaces cannot be used safely while driving, much effort has been done to make automatic speech recognition (ASR) and Text-to-Speech synthesis (TTS) ubiquitous features in the car. From voice dialing to car navigation, the requirements for voice technology vary greatly. While the use ...
متن کاملSpeaker-dependent Speech Recognition Based on Phone-like Units Models | Application to Voice Dialing
This paper presents a speaker dependent speech recognition with application to voice dialing. This work has been developed under the constraints imposed by voice dialing applications, i.e., low memory requirements and limited training material. Two methods for producing speaker dependent word baseforms based on Phone Like Units (PLU) are presented and compared : (1) a classical vector quantizer...
متن کاملVoice morphing and the manipulation of intra-speaker and cross-speaker phonetic variation to create foreign accent continua: a perceptual study
The STRAIGHT system of voice morphing was used to create voice continua of (Korean) accented Australian English, intended to simulate phonetic variation ranging from ‘heavily accented’ to ‘unaccented’ (native-like) Australian English, employing dimensions of intra-speaker and cross-speaker variation to yield a range of synthetic voices. These synthetic voices were evaluated against actual sampl...
متن کاملSpeaker-dependent speech recognition based on phone-like units models-application to voice dialling
This paper presents a speaker dependent speech recognition with application to voice dialing This work has been devel oped under the constraints imposed by voice dialing appli cations i e low memory requirements and limited training material Two methods for producing speaker dependent word baseforms based on Phone Like Units PLU are pre sented and compared a classical vector quantizer is used t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996